Overview of machine learning

نویسنده

  • Kevin P. Murphy
چکیده

The most widely studied problem in machine learning is supervised learning. We are given a labeled training set of input-output pairs, D = (xi, yi)i=1, and have to learn a way to predict the output or target ỹ for a novel test input x̃ (i.e, for x̃ 6∈ D). (We use the tilde notation to denote test cases that we have not seen before.) Some examples include: predicting if someone has cancer ỹ ∈ {0, 1} given some measured variables x̃; predicting the stock price tomorrow ỹ ∈ IR given the stock prices today x̃; etc. A common approach is to just predict one’s “best guess”, such as ŷ(x̃). However, we prefer to compute a probability distribution over the output, p(ỹ|x̃), since it is very useful to have a measure of confidence associated with one’s prediction, especially in medical and financial domains. In addition, probabilistic methods are essential for unsupervised learning, as we discuss in Section 3. If y is discrete or categorical, say y ∈ {1, 2, . . . , C}, this problem is called classification or pattern recognition. If there are C = 2 classes or labels, the problem is called binary classification (see Figure 1 for an example), otherwise it is called multi-class classification. We usually assume the classes are mutually exclusive, so y can only be in one possible state. If we want to allow multiple labels, we can represent y by a bit-vector of length C, so yj = 1 if y belongs to class j. If y is continuous, say y ∈ IR, this problem is called regression. If y is multidimensional, say y ∈ IR , we call it multivariate regression. If y is discrete, but ordered (e.g., y ∈ {low,medium,high}), the problem is called ordinal regression. A priori, our prediction might be quite poor, but we are provided with a labeled training set of input-output pairs, D = (xi, yi) n i=1, which provides a set of examples of the “right response” for a set of possible inputs. If each input

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of Vocabulary Learning Strategies in English as a Foreign Language

Researchers in the area of EFL learning have tried to put the way(s) by which EFL learners learnEnglish vocabulary into some frames and present them as strategies. This paper reviewsdescriptive research on vocabulary learning strategies in English as a foreign language. Thereview focuses on common strategies that learners use in vocabulary learning such as dictionarystrategies, note-taking stra...

متن کامل

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

Comparative Analysis of Machine Learning Algorithms with Optimization Purposes

The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches‎. ‎Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data‎. ‎In this paper‎, ‎a methodology has been employed to opt...

متن کامل

Overview of learning theories and its applications in medical education

Introduction: The purpose of teaching is learning, and learning is related to learning theories. These theories describe and explain how people learn. According to various experts' opinion about learning, many theories emerged. The paper reviewed three major approaches include behaviorism, cognitive and constructive learning and its educational applications in medical science. Methods: this pa...

متن کامل

Machine Learning and Citizen Science: Opportunities and Challenges of Human-Computer Interaction

Background and Aim: In processing large data, scientists have to perform the tedious task of analyzing hefty bulk of data. Machine learning techniques are a potential solution to this problem. In citizen science, human and artificial intelligence may be unified to facilitate this effort. Considering the ambiguities in machine performance and management of user-generated data, this paper aims to...

متن کامل

Machine learning based Visual Evoked Potential (VEP) Signals Recognition

Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007